Multimodal language processing

نویسنده

Michael Johnston

چکیده

Multimodal interfaces enable more natural and effective humancomputer interaction by providing multiple channels through which input or output may pass. In order to realize their full potential, they need to support not just input from multiple modes, but synchronized integration of semantic content from different modes. This paper describes a multimodal language processing architecture which allows for declarative statement of multimodal integration strategies in a unification-based grammar formalism. The architecture is currently deployed in a working system enabling interaction with dynamic maps using speech and pen, but the approach is more general and supports a wide variety of other potential multimodal interfaces.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language Technology – a Survey of the State of the Art Language Resources – Multimodal Language Resources

This article provides an overview of research in multimodal language processing and associated resources. It defines multimodal processing, describes key challenges, identifies potential benefits, and outlines the major tasks, including multimodal input interpretation, multimodal output generation, and multimodal information access. The article exemplifies the state of the art in multimedia and...

متن کامل

Achieving Multimodal Cohesion during Intercultural Conversations

How do English as a lingua franca (ELF) speakers achieve multimodal cohesion on the basis of their specific interests and cultural backgrounds? From a dialogic and collaborative view of communication, this study focuses on how verbal and nonverbal modes cohere together during intercultural conversations. The data include approximately 160-minute transcribed video recordings of ELF interactions ...

متن کامل

The multimodal nature of spoken word processing in the visual world: Testing the predictions of alternative models of multimodal integration

Ambiguity in natural language is ubiquitous (Piantadosi, Tily & Gibson, 2012), yet spoken communication is effective due to integration of information carried in the speech signal with information available in the surrounding multimodal landscape. However, current cognitive models of spoken word recognition and comprehension are underspecified with respect to when and how multimodal information...

متن کامل

A Multimodal Approach toward Teaching for Transfer: A Case of Team-Teaching in ESAP Writing Courses

This paper presents a detailed examination of learning transfer from an English for Specific Academic Purposes course to authentic discipline-specific writing tasks. To enhance transfer practices, a new approach in planning writing tasks and materials selection was developed. Concerning the conventions of studies in learning transfer that acknowledge different learning preferences, the instruct...

متن کامل

Multimodal signal processing in naturalistic noisy environments

When a system must process spoken language in natural environments that involve different types and levels of noise, the problem of supporting robust recognition is a very difficult one. In the present studies, over 2,600 multimodal utterances were collected during both mobile and stationary use of a multimodal pen/voice system. The results confirmed that multimodal signal processing supports s...

متن کامل

Deep learning: from speech recognition to language and multimodal processing

APSIPA Transactions on Signal and Information Processing / Volume 5 / 2016 / e1 DOI: 10.1017/atsip.2015.22, Published online: 19 January 2016 Link to this article: http://journals.cambridge.org/abstract_S2048770315000220 How to cite this article: Li Deng (2016). Deep learning: from speech recognition to language and multimodal processing. APSIPA Transactions on Signal and Information Processing...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Multimodal language processing

نویسنده

چکیده

منابع مشابه

Language Technology – a Survey of the State of the Art Language Resources – Multimodal Language Resources

Achieving Multimodal Cohesion during Intercultural Conversations

The multimodal nature of spoken word processing in the visual world: Testing the predictions of alternative models of multimodal integration

A Multimodal Approach toward Teaching for Transfer: A Case of Team-Teaching in ESAP Writing Courses

Multimodal signal processing in naturalistic noisy environments

Deep learning: from speech recognition to language and multimodal processing

عنوان ژورنال:

اشتراک گذاری